A two-phase pitch marking method for TD-PSOLA synthesis
نویسندگان
چکیده
This paper describes a robust two-phase pitch marking method based on peak-valley decision and dynamic programming. In the first phase, we select either peaks or valleys for pitch mark candidates according to its similarity to an estimated pitch curve. In the second phase, we define state and transition probabilities, and then employ dynamic programming to find the most likely pitch marks. We have also designed different tests to demonstrate the feasibility of the proposed approach.
منابع مشابه
Prosody Modification Using Allpass Residual of Speech Signals
In this paper, we attempt to signify the role of phase spectrum of speech signals in acquiring an accurate estimate of excitation source for prosody modification. The phase spectrum is parametrically modeled as the response of an allpass (AP) filter, and the filter coefficients are estimated by considering the linear prediction (LP) residual as the output of the AP filter. The resultant residua...
متن کاملHybrid electroglottograph and speech signal based algorithm for pitch marking
Pitch marking is very significant in speech signal processing. In a text-to-speech (TTS) system based on the Time-Domain Pitch-Synchronous Overlap-Add (TD-PSOLA) method, robust estimation of pitch marks (PM) is especially important to the modification of the time and pitch scale of a speech signal in order to match it to that of the target speaker. The aim of this paper is to improve the accura...
متن کاملAccurate pitch marking for prosodic modification of speech segments
This paper describes a new approach to pitch marking. Unlike other approaches that use the same combination of features for the whole signal, we take into account the signal properties and apply different features according to some heuristic. Basically we use a special type of energy contour for pitch marking. Where the energy information turns out to be not suitable as an indicator we resort t...
متن کاملPitch Marking Based on an Adaptable Filter and a Peak-Valley Estimation Method
In a text-to-speech (TTS) conversion system based on the time-domain pitch-synchronous overlap-add (TD-PSOLA) method, accurate estimation of pitch periods and pitch marks is necessary for pitch modification to assure an optimal quality of the synthetic speech. In general, there are two major issues on pitch marking: pitch detection and location determination. In this paper, an adaptable filter,...
متن کاملA novel model TD-PSPTP for speech synthesis
In this paper, a novel approach based on timedomain pitch-synchronous point-to-point (TD-PSPTP) model for speech synthesis is presented. Compared to TD-PSOLA, which is currently one of the most popular concatenation methods, TD-PSPTP model provides a wider range of pitch and time modification. The quality of synthesized speech by TD-PSPTP shows to be high, especially its capability of overcomin...
متن کامل